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This research reports the results of a comprehensive investigation into the 
effectiveness of team building. The article serves to update and extend Salas, 
Rozell, Mullen, and Driskell’s (1999) team-building meta-analysis by assess¬ 
ing a larger database and examining a broader set of outcomes. Our study 
considers the impact of four specific team-building components (goal setting, 
interpersonal relations, problem solving, and role clarification) on cognitive, 
affective, process, and performance outcomes. Results (based on 60 correla¬ 
tions) suggest that team building has a positive moderate effect across all 
team outcomes. In terms of specific outcomes, team building was most 
strongly related to affective and process outcomes. Results are also presented 
on the differential effectiveness of team building based upon the team size. 

Keywords: team building; team performance; team development 


Teams of people working together for a common cause touch all our lives. 
From everyday activities like air travel, fire fighting, and running the United 
Way drive to amazing feats of human accomplishment like climbing Mt. 
Everest and reaching for the stars, teams are at the center of how work gets 
done in modern life. 

Kozlowski & Ilgen, 2006, p. 78 

This quote, from a recent review of work-team effectiveness, exemplifies 
the central role that teams play in our lives. Although many labels have 
been applied to team-based forms of organizing (i.e., crews, teams, groups, 
and collectives), these entities are essential to the accomplishment of orga¬ 
nizational goals. Indeed, there is ample support in the literature for the 
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contention that team-based forms of organizing are beneficial both to 
organizations and to individuals. For example, Applebaum and Batt (1994) 
reviewed 12 large-scale surveys and 185 case studies of managerial prac¬ 
tices and concluded that team-based work leads to improvements in orga¬ 
nizational performance on measures of both efficiency and quality. They 
argued that team-based systems benefit workers because of the higher like¬ 
lihood of job enhancement, autonomy, and skill development associated 
with these systems. However, the simple existence of a team-based orga¬ 
nizing structure is not enough to ensure that positive outcomes will result. 
Teams must be nurtured, supported, and developed. 


The Motivation for Understanding the 
Efficacy of Team Building 

There are three motivations for understanding the efficacy of team¬ 
building interventions in organizations. First, team building is one of the 
most commonly applied group development interventions in organizations 
today. It is widely used and comes in many forms, including outdoor expe¬ 
riential activities and indoor group process discussions. However, no one is 
quite sure how and why these interventions work, or if they even work at 
all. Considering the vast sum of money directed toward the development of 
teams in organizations, it is important that practitioners (and researchers) 
gain a better understanding of the effectiveness and boundary conditions of 
team building. 

Second, as there are many options available to organizations in the 
pursuit of improved teamwork, it is important to determine whether team 
building is a worthy choice. Some of these interventions are organizational 
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or structural in nature and do not specifically target team member inter¬ 
actions (e.g., job redesign, selection systems, and group incentive and 
performance management programs). In contrast, team-development inter¬ 
ventions—those whose quest is to directly impact the functioning and 
effectiveness of work teams—provide the focus for the current research 
integration. These interventions, when properly conducted, can have a 
positive impact on organizations. Consider a research study conducted by 
Macy and Izumi (1993), who analyzed 131 studies of organizational 
change. They found that interventions with the largest effects upon finan¬ 
cial measures of organizational performance were team-development 
interventions. That is, of all organizational interventions, those that focus 
on team development had the largest effect on measures of financial 
performance. 

Third, beyond financial performance, it is widely understood that team 
developmental interventions are key mechanisms that may be used to facil¬ 
itate team effectiveness (Noe, 2002). Therefore, it is important to under¬ 
stand how these interventions are most effective. At a general level, 
team-development interventions may include some form of team-training 
or team-building activities. Although both types of team-development inter¬ 
ventions are designed to improve team functioning and effectiveness (espe¬ 
cially when the science of individual and team training is utilized; Salas & 
Cannon-Bowers, 1997), team training and team building differ in important 
ways (Tannenbaum, Beard, & Salas, 1992). Team training is skill-focused 
(i.e., it is focused on gaining specific competencies), typically includes a 
practice component, and is done in context. It is generally formal and 
systematic. Team building, on the other hand, does not target skill-based 
competencies, is not systematic in nature, and is typically done in settings 
that do not approximate the actual performance environment. For our pur¬ 
poses, we define team building as a class of formal and informal team-level 
interventions that focus on improving social relations and clarifying roles, 
as well as solving task and interpersonal problems that affect team func¬ 
tioning. Team building works by assisting individuals and groups to exam¬ 
ine, diagnose, and act upon their behavior and interpersonal relationships 
(Schein, 1969, 1999). 

In light of the above, this article investigates the efficacy of team building. 
We begin with a brief review of conceptual issues and methodological issues 
in team building. Next, the hypotheses for this research are presented. 
Finally, the results of several meta-analytic integrations are discussed. First, 
however, is the rationale for the present research. 
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Understanding the Need for an Update 

It is an unfortunate indictment of the literature and practice in this area 
that we are still searching for answers to questions posed by Beer (1976) 
and Salas, Rozell, Mullen, and Driskell (1999). Namely, does team build¬ 
ing result in positive outcomes? Why? Under what conditions? Upon a 
careful review of the extent literature, it is clear that these questions need to 
be examined more closely to clarify our understanding of the effectiveness 
and boundary conditions of team building. Thus, this article reports the 
findings from a series of meta-analytic investigations of research on the 
efficacy of team-building interventions to update and extend the current 
state of knowledge in the team-building domain. In total, this research will 
examine three moderator variables in addition to making an overall assess¬ 
ment of the efficacy of team-building interventions. The current replication 
is meant to be systematic, rather than direct in nature (e.g., Aronson, 
Ellsworth, Carlsmith, & Gonzales, 1990). That is, rather than replicating 
the exact analyses from the study by Salas and colleagues, the current 
research will replicate and extend the earlier study in an effort to resolve 
ambiguities and provide new insights for organizational stakeholders and 
academicians alike concerning the effectiveness of team building. 

Although the empirical evidence on team-building interventions is lim¬ 
ited, a critical investigation of the available literature is warranted for sev¬ 
eral reasons. First, since the 1990s there has been an increasing incursion 
of team-building interventions in organizations. Some of the most recent 
trends in team building have taken these interventions into the kitchen and 
even the wilderness. Second, many practitioners feel that these interven¬ 
tions are useful. However, a more careful exploration of the utilities and 
strengths of these interventions would benefit practitioners now and would 
benefit the development of these interventions in the long run. Finally, there 
are enough data available to form estimates regarding some of the variables 
that may moderate the impact a team-building intervention may have on 
team outcomes. 

There is a general consensus for the idea that the science of team train¬ 
ing can be applied to enhance team functioning across organizations (e.g., 
Kozlowski & Ilgen, 2006; Salas & Cannon-Bowers, 1997). At the same 
time, reviews of team effectiveness have noted the inconsistent findings for 
the effectiveness of team-building interventions. Despite these inconsistent 
findings, researchers such as Kozlowski and Ilgen have acknowledged the 
potential that these interventions may have on shaping team development 
and on improving team effectiveness. Given the increasing frequency of 
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team-building interventions, if researchers do not catch up and start doing 
this research, an important window might be missed to constructively shape 
the important practice in this area. Thus, examining team building further— 
a topic that has not received recent sufficient attention in the literature—is 
critical. In the next section, a number of conceptual and methodological 
issues in team building are discussed. 


Team Building 

Conceptual Issues in Team Building 

Originally designed as a group process intervention (e.g., Schein, 1969, 
1999) for improving interpersonal relations and social interactions, team 
building has evolved to also include a concern for achieving results, meet¬ 
ing goals, and accomplishing tasks (Payne, 2001). In the late 1990s, Salas 
and colleagues (1999) described team-building interventions as extremely 
popular and common. According to Beer (1976), there are four basic 
approaches to team building, including: (a) a goal-setting, problem-solving 
model; (b) an interpersonal model; (c) a role model; and (d) the Managerial 
Grid (Blake & Mouton, 1964) model. However, this initial conceptualiza¬ 
tion has since been reconsidered. 

Refinements to this four-pronged system began with the Managerial Grid 
model being dropped as a distinct team-building approach. In addition to 
dropping the Managerial Grid model, modem conceptualizations have sep¬ 
arated the goal-setting and problem-solving approaches, using Buller’s 
(1986) problem-solving component as a distinct approach. As discussed by 
Buller, team-building models rarely exist in pure form. That is, the interven¬ 
tions reported in the literature usually involve elements from several or all of 
the models. As a result, he proposed a general problem-solving model that 
follows Dyer’s (1977) problem-solving framework. This model incorporates 
a focus on task or interpersonal issues, goal setting, and role clarification, 
depending on the nature of the specific problems identified for the group 
under investigation. Moreover, the problem-solving approach to team build¬ 
ing is said to subsume each of Beer’s (1976) components, and, as perhaps 
evident by its title, emphasizes the identification of major problems in the 
team. All told, these modifications have added clarity to the investigation 
and implementation of team building. Unfortunately, this newfound concep¬ 
tual clarity may not have come soon enough for many investigators who had 
previously sought to assess the efficacy of team building. 
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Perhaps because of the conceptual confusion that existed in this area, 
many reviews of team building did not include the same articles. For example 
two noteworthy reviews (DeMeuse & Liebowitz, 1981; Woodman & 
Sherwood, 1980) had only 14 studies in common out of a total of 66 studies 
reviewed. A subsequent review by Buller (1986) included only 9 studies, 3 in 
common with Woodman and Sherwood (1980) and 6 in common with 
DeMeuse and Liebowitz. One reason for the apparent lack of consistency in 
the articles chosen for reviews is that team building has been an ill-defined 
concept (Buller, 1986). At the time of these early reviews there was no 
agreed-upon operational definition of the intervention. Taken together, there 
is now a consensus position that there exist four distinct models of team 
building. Although combinations of these approaches are common, the 
models include goal-setting, developing interpersonal relations, clarifying 
roles, and creating additional capacity for problem solving (Beer, 1976; 
Buller, 1986; Dyer, 1987; Salas et al., 1999). Table 1 describes each of the 
four models or components of team building in more detail. 

Despite the recently established consensus on team-building compo¬ 
nents, there have been other problematic issues that have persisted for 
researchers and practitioners of this topic. Specifically, many early efforts 
were plagued by a number of methodological issues. A few of these are dis¬ 
cussed in the next section. 

Methodological Issues in Team Building 

Early reviews of team building described both a lack of extensive research 
on the issue and trepidation concerning the methodological rigor of published 
studies (Buller, 1986; DeMeuse & Liebowitz, 1981; Tannenbaum et al., 
1992; Woodman & Sherwood, 1980). That is, even if one could get beyond 
the disagreement concerning operational definitions of team building, 
much of the previous team-building research is characterized by method¬ 
ological flaws (i.e., study design and measurement issues). For example, 
Buller (1982) found that more than half of the reported team-building studies 
employed pre-experimental designs—designs that do not allow for causal 
inferences. Also problematic, there was often no attempt to disentangle the 
effects of team building from other interventions that may be in process 
within an organization. Although this can be a common problem for field 
research in general, it has been particularly problematic in the team-building 
domain. 

Tannenbaum and colleagues (1992) highlighted another methodological 
flaw in the team-building research. Specifically, they pointed out that there 
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Table 1 

Models/Components of Team Building 

Salas, Rozell, Mullen, Salas, Priest, & 

Component & Driskell, 1999 DeRouin, 2005 


Goal setting Emphasis: Setting objectives 
and development of 
individual and team goals. 

Team members: Become involved 
in action planning to identify 
ways to achieve goals. 


Interpersonal Emphasis: Increasing teamwork 
relations skills (i.e., mutual supportiveness, 

communication, and sharing 
of feelings). 

Team members: Develop trust in 
one another and confidence 
in the team. 


Role Emphasis: Increasing communication 

clarification among team members regarding 
their respective roles within 
the team. 

Team members: Improve 

understanding of their own and 
others’ respective roles and duties 
within the team. 


Problem Emphasis: Identifying 

solving major task-related problems 

within the team. 

Team members: Become involved 
in action planning, implement 
solutions to identify problems 


Designed to strengthen team member 
motivation to achieve team goals 
and objectives. 

By identifying specific outcome 
levels, teams can determine what 
future resources are needed. 

Individual characteristics (e.g., team 
member motivation) can also be 
altered by use of this intervention. 

Based on the assumption that teams 
with fewer interpersonal conflicts 
function more effectively than 
teams with greater numbers of 
interpersonal conflicts. 

Requires the use of a facilitator to 
develop mutual trust and open 
communication between team 
members. 

As team members achieve higher 
levels of trust, cooperation, and 
cohesiveness, team characteristics 
can be changed as well. 

Defines the team as comprising a set 
of overlapping roles. 

These overlapping roles are 

characterized as the behaviors that 
are expected of each individual 
team member. 

Can be used to improve team and 
individual characteristics (i.e., by 
reducing role ambiguity) and work 
structure by negotiating, defining, 
and adjusting team member roles. 

Buller’s (1986) problem-solving 
component subsumes aspects from 
all of the components described by 
Beer (1976). 

Team members practice setting goals, 
develop interpersonal relations, 
clarify team roles, and work to 

(continued) 
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Table 1 (continued) 


Salas, Rozell, Mullen, 

Salas, Priest, & 

Component 

& Driskell, 1999 

DeRouin, 2005 


and to evaluate those 

improve organizational 


solutions. 

characteristics through 
problem-solving tasks. 

Can have the added benefit of 
enhancing critical-thinking skills. 


has been a reliance on measuring the effectiveness of team-building inter¬ 
ventions with process measures. Although implicitly appealing, improve¬ 
ments in processes can not always be linked to improvements in team 
performance (e.g., Porras & Wilkens, 1980). For example, team perfor¬ 
mance is typically fashioned by additional environmental and/or organiza¬ 
tional characteristics and contingencies that are out of the volitional control 
of team members. Stated differently, team members are active participants 
in the enactment of team processes, but must also interact within the larger 
system to produce more distal performance outputs. Team processes may 
also be impacted by the larger organizational environment, but not likely to 
the same degree as team performance outputs. 

As a final methodological concern with previous team-building 
research, there has been an overreliance on subjective indicators of group 
or organizational performance criteria as dependent measures (Tannenbaum 
et al., 1992). Though this type of information may be interesting and rele¬ 
vant to measuring participant satisfaction and other affective outcomes that 
may be impacted by team-building interventions, it is often not concrete 
enough to allow for accurate predictions of the performance outcomes of 
team building. 

A Recent Advancement in Team-Building Research 

Despite the conceptual and methodological issues associated with eval¬ 
uations of team building, one recent effort has represented advancement 
over previous reviews by empirically investigating the effectiveness of 
these interventions. Specifically, Salas and colleagues (1999) responded to 
a call by Buller (1986) to use the primary focus of the intervention 
(i.e., goal setting, interpersonal relations, role clarification, and problem 
solving) as a potential moderating variable. The results of their study failed 
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to indicate a relationship between the combined set of team-building inter¬ 
ventions and team performance (r = .01, k = 16 effect sizes). Moreover, of 
the four components of team building, only role clarification proved to be 
effective, as judged by both objective (r = .71) and subjective (r = .75) 
accounts of performance. Interestingly, for objective and subjective mea¬ 
sures there was a nonsignificant effect of goal setting (r = -.06 and -.11, 
respectively), interpersonal relations (r = -.38 and -.04, respectively), and 
problem solving (r = -.31 and .09, respectively). Also examined in this 
study were the potential moderating influences of the source of the criterion 
measurement of performance (i.e., objective vs. subjective), team size, and 
training duration. For objective measures of performance, there was no evi¬ 
dence of a relationship between team building and performance (r = -.04, 
k = 8); for subjective measures, there was a small positive relationship 
between team building and performance (r = .14, k = 8). Concerning team 
size, the results of their study suggested that the effects of team building on 
performance decreased as a function of the size of the team (r = -.34). 
Finally, there was a slight tendency for the effects of team building to 
decrease as a function of the duration of the intervention (r = -.20). 

Taken together, this study enhanced our understanding of the efficacy of 
team building. However, there were a number of limitations associated with 
it—limitations that now necessitate the need for additional inquiry. For 
example, the amount of data analyzed in this study was relatively modest. 
Specifically, the research findings presented were based on only 16 effect 
sizes, and thus, it is difficult to discern, with any degree of confidence, the 
actual effectiveness of these interventions. Moreover, the findings derived 
from the Salas and colleagues’ (1999) study did not necessarily reflect the 
findings from existing narrative reviews. Finally, their study left many 
questions unanswered regarding the potential moderating impact of other 
relevant variables. As an organizing tool provided to summarize the litera¬ 
ture in this area, Table 2 provides a summary of five previous investigations 
into the efficacy of team building, and includes the number of articles 
reviewed, the years spanned, and other noteworthy features. 

In summary the current research was initiated to provide an updated 
meta-analysis of the team-building literature. Phrased in the form of ques¬ 
tions, an examination of these issues will help clarify our understanding of 
the effectiveness of team building and lead to hypothesized relationships 
involving the effectiveness of team building, including an investigation of 
specific moderators. A model that serves to graphically illustrate these 
hypotheses is presented in Figure 1. 
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Table 2 

Summary of Previous Team-Building Reviews 


Study 

Review Details 


Summary of 
Findings 


Noteworthy 

Features 


Woodman & 
Sherwood 
(1980) 


DeMeuse & 
Liebowitz 
(1981) 


Buller 

(1986) 


Tannenbaum, 
Beard, & 
Salas 
(1992) 


30 articles 

Qualitative 

data 

Years spanned: 
1964-1978 


36 articles 

Qualitative 

data 

Years spanned: 
1962-1980 


9 articles 

Qualitative 

data 

Years spanned: 
1964-1981 


17 articles 

Qualitative 

data 

Years spanned: 
1980-1988 


TB elicits positive affective 
reactions. 

The linkage between TB and 
work group performance 
remains largely unsubstantiated. 

TB is more commonly 

conducted with management 
teams than groups lower 
in organizational hierarchies. 

TB is more commonly conducted 
with intact and established 
work teams than new groups. 

Affective reactions as dependent 
measures are used more often 
than objective performance data. 

TB is described as having great 
promise for improving employee 
attitudes, perceptions, behaviors, 
and organizational effectiveness. 

Eighty-seven percent of the 68 

evaluations indicated positive results/ 

Due to the lack of rigorous research 
designs, firm conclusions 
concerning the effectiveness of these 
interventions could not be made. 

TB must be more carefully defined. 

More rigorous experimental 
designs should be used. 

There are numerous methodological 
flaws in previous TB studies. 

Therefore a clear assessment of the 
TB and task performance 
relationship had yet to emerge. 


The quantity of TB research decreased; 
however, the quality had improved. 

Most studies used multiple components 
in their TB interventions. 

More researchers began using 

behavioral and objective measures. 

Presented evidence to cast doubt the 
connection between process and 
performance; TB is effective, but 
for only perceptions and attitudes. 


Distinguished between team 
development and T-group 
or sensitivity training. 

Made subjective assessments 
of the internal validity 
of the studies reviewed. 


Coded studies according to 
research design, sample size, 
multiple dependent variables, 
and duration of the TB 
intervention. 


Presented and described a 
general problem solving 
approachto TB that added to 
Beer’s (1976) 
four-component model. 

Argued that Beer’s 

classification is difficult to 
use in practice because TB 
programs usually involve 
elements from each of the 
models. 

Presented a comprehensive 
model of team effectiveness 
that continues to influence 
research and theorizing 
in this field. 


( continued) 
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Table 2 (continued) 


Study 

Review Details 

Summary of 

Findings 

Noteworthy 

Features 

Salas, Rozell, 11 articles 

Post-intervention strategies may be a 
key mechanism to ensure the 
long-term effectiveness of these 
interventions. 

No significant effect of TB on 

First known attempt to 

Mullen, & 

performance. 

empirically summarize the 

Driskell Quantitative 

A nonsignificant tendency for TB to 

effectiveness of TB. 

(1999) data 

result in lower performance when 

Assessed a number of 

Years spanned: 

measured objectively, but increase 
performance when subjective 

moderators, including team 
building component, team 

1965-1990 

measures were used. 

size, training duration, and 


The role clarification component was 
more likely to increase performance. 15 
The effects of TB decreased as the size 

of the team increased. 

The effects of TB decreased as the 

duration of the intervention increased. 

type of performance measure 
used (i.e., objective 
versus subjective). 


Note: TB = team building. 

a. Many of the 36 studies reported evaluations of multiple dependent variables. 

b. This result should be interpreted with caution as it was based on a small number of effect sizes (it’s dif¬ 
ficult to determine, but likely only three or four effect sizes were used in this calculation). 


Hypotheses 

Is Team Building Effective? 

We agree with Salas and colleagues (1999) that previous narrative 
reviews have frequently expressed the benefits that can result from team 
building (e.g., Buller, 1986; DeMeuse & Liebowitz, 1981; Sundstrom, 
DeMeuse, & Futrell, 1990, Tannenbaum et al., 1992; Woodman & 
Sherwood, 1980); however, there has been a lack of definitive, compelling 
evidence concerning the positive effects of team building on team performance. 
Specifically, previous qualitative reviews of the team-building domain have 
concluded that evidence of an effect of team building on performance was 
“inconclusive” (Buller, 1986), “unsubstantiated” (Woodman & Sherwood, 
1980), “equivocal” (Tannenbaum et al., 1992), and “mixed” (Sundstrom 
et al., 1990). Meta-analytic results from one study have suggested there is 
no overall effect of team building on team performance (Salas et al., 1999). 
However, there was support for the role-clarification component of team 
building. Theoretically, one would assume that an intervention focused on 


Downloaded from sgr.sagepub.com at UNIV OF NEW ORLEANS on June 17, 2014 





12 Small Group Research 


Figure 1 

Theoretical Model Depicting Study Hypotheses 


Team - Building Components 


Goal Setting 


Interpersonal Relations 


Problem Solving 


Role Clarification 


Team Size 

• Small 

• Medium 

• Large 


Hi -H3 


Outcomes 


Cognitive 


Affective 


Process 


Performance 


improving team functioning would result in positive (as opposed to nega¬ 
tive) outcomes. In addition, the moderate support suggested by several 
narrative reviews (e.g., Tannenbaum et al.) leads us to predict a positive 
overall effect of team building on team functioning. Thus, we present our 
first hypothesis: 

Hypothesis 1: Team building interventions will result in enhanced team out¬ 
comes. 

The result from testing this hypothesis will provide for a baseline judg¬ 
ment concerning the efficacy of this particular form of team-development 
intervention. The omnibus test will therefore allow for an overall assess¬ 
ment of the efficacy of team building, with all independent outcomes from 
primary studies combined for this analysis. 

Is Team Building More Effective for Some 
Outcomes Than Others? 

This framing question asks whether the combined set of team-building 
interventions is shown to be more useful for improving certain team outcomes 
than others. Specifically, does team building work better for cognitive 
outcomes (e.g., declarative knowledge of teamwork competencies), team 
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member affective outcomes (e.g., trust, team potency), team processes 
(e.g., coordination, communication), or team performance outcomes (e.g., 
volume of sales, productivity measures)? This division of team-building 
outcomes is similar to the commonly discussed cognitive, affective, and 
skill-based breakdown of general training outcomes (e.g., Kraiger, Ford, & 
Salas, 1993). However, for the current research, skill-based outcomes are 
further divided into two additional categories—team processes and more 
performance-related or productivity-related outcomes. 

The division and examination of team-building effectiveness based on 
specific outcomes is intended to help clarify the often disparate results 
seen in the literature on these interventions. For example, Woodman and 
Sherwood’s (1980) qualitative review of team building concluded that it 
was only useful for facilitating affective outcomes, not team performance. 
Equally pejorative to the position that team building results in improved 
team performance was the conclusion provided by Buller (1986), who upon 
reviewing team-building studies conducted through 1980, suggested that 
the relationship between team building and performance was inconclusive. 
In yet another review that added to the opaque nature of the efficacy of team 
building, DeMeuse and Liebowitz (1981) accurately noted that most of the 
early research on team building relied almost exclusively on perceptual 
ratings of the dependent variables being studied. 

More recent reviewers of the efficacy of team building have reported 
somewhat different (i.e., more positive) conclusions. For example, the inte¬ 
gration of team-building research reported by Salas and colleagues (1999) 
has served, in many ways, as the focal point for the current research. And, 
although there was no significant overall effect of team building in their 
research, there was a small, yet significant, tendency for team building to 
increase performance when criteria were assessed with subjective measures. 
In another investigation into the efficacy of team building, Tannenbaum and 
colleagues (1992) reviewed team-building research conducted in the 1980s 
and found support for these interventions, especially when the outcome of 
interest was limited to team member perceptions or attitudes. Finally, 
Svyantek, Goodman, Benz, and Gard (1999) found that team building posi¬ 
tively impacted work-group productivity. Specifically, the largest impact of 
team building was on productivity measures of cost-effectiveness, with a 
smaller influence on quantity and quality measures. 

In general, previous research that has investigated the effectiveness of 
team building for improving specific outcomes has been equivocal at best, 
especially considering performance or productivity measures. At the same 
time, there has been perhaps the greatest support for the efficacy of team 
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building for improving affective or attitudinal outcomes. Finally, to our 
knowledge there have been no reviews citing the correlation between team¬ 
building interventions and improvements in cognitive outcomes. 

The mixed results found in the literature concerning the effect of team¬ 
building interventions on team outcomes lead us to believe that moderators 
may exist. Therefore, team building impacts certain outcomes more than 
others. Based on the general characteristics of team building, (e.g., that 
team building develops interpersonal relations, mutual trust, and open com¬ 
munication between team members), it is likely that team building will 
have a greater effect on affective outcomes than any on other type of out¬ 
comes. As a process intervention, it makes theoretical sense that team 
building would result in enhanced team member affective outcomes. 
Finally, although the findings of more distal reviews of team building were 
taken into consideration, a decision was made to place more credence on 
recent research, which more often reported positive findings. These find¬ 
ings, along with a complete consideration of the expected benefits of team 
building, led us to Hypotheses 2a and 2b: 

Hypothesis 2a: Team building interventions will result in improved outcomes 
across each of the four outcome types. 

Hypothesis 2b: Team building will be most effective for improving affective 
outcomes. 

Does the Focus of Team Building Moderate Its Effectiveness? 

It is certainly important to be able to identify whether team building 
results in enhanced team functioning, but it is perhaps more informative to 
know which forms or components of team building are most effective. 
Advancements in theory in the last quarter century (e.g., Buller, 1986; Salas 
et al., 1999; Salas, Priest, & DeRouin, 2005) have allowed for the parcel¬ 
ing of team-building interventions into four distinct foci (see Table 1). 
Unfortunately, the single existing empirical integration on the relative effi¬ 
cacy of different components of team building (i.e., Salas et al.) did not 
provide encouraging results. Combining subjective and objective estimates, 
there was a slight (nonsignificant) tendency for goal-setting (r = -.16), 
interpersonal relations (r = -.06), and problem-solving (r = -.05) models 
of team building to result in decreased performance. It was only the 
role-clarification component that appeared to be effective for improving 
performance (r = .76). However, upon examination of their data set, it 
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appears this positive finding for role clarification was supported by the 
results from only three studies and three effect sizes. Thus, caution and further 
investigation are warranted before concluding that the role-clarification 
component of team building is superior to the other forms. 

Nonetheless, because the role-clarification component of team building 
is designed to relieve role stress as created by role ambiguity or role con¬ 
flict, it is reasonable to anticipate that a significant improvement in team 
functioning should result. In addition, the role-clarification component of 
team building emphasizes communication among team members, and 
thus it is likely that an increase in the level and quality of communication 
between team members will impact their effectiveness. Unlike the other 
forms of team building, improvements in role clarity and communication 
are expected to produce more lasting benefits in terms of team function¬ 
ing (as assessed through an analysis of the combined set of team outcomes). 
Combining this theoretical rationale with the preliminary findings of 
Salas and colleagues (1999) concerning the role-clarification component 
of team building, it is our belief that a team-building intervention that 
utilizes a role-clarification focus will provide the most benefit to team 
functioning. At the same time, it is our view that any carefully thought out 
team-building intervention should have at least some positive impact on 
team members. Thus, we also expect that team building that focuses on 
interpersonal relations, goal setting, or problem solving will also prove 
useful, at least for the short-term benefits that are typically assessed. 
Taking these dual considerations into perspective, we present our next set 
of hypotheses: 

Hypothesis 3a: Each of the four components of team building will demon¬ 
strate a moderate level of effectiveness for improving team functioning. 

Hypothesis 3b: The role clarification component of team building will be 
most effective for improving team functioning. 

Is the Effectiveness of Team Building 
Moderated by Team Size? 

This research will also investigate whether the effectiveness of team 
building is moderated by the size of the team. At a basic level, “the resources 
available on a team result from how many people are on it” (Hambrick & 
D’Aveni, 1992, p. 1449). Sundstrom, McIntyre, Halfhill, and Richards’s 
(2000) review of 83 field studies and experiments conducted with work 
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groups suggested that the average group size was 11 members. Moreover, 
Halebian and Finkelstein (1993) have suggested that team size is synony¬ 
mous with cognitive capability. Providing support for this assertion, Bantel 
and Jackson (1989) found that larger teams generally have a greater reservoir 
of cognitive resources than smaller teams. There is little doubt that there are 
benefits to having medium- to large-sized teams available to perform work 
tasks rather than smaller teams. However, research has also found that larger 
teams can facilitate the enactment of other, less desirable, group phenome¬ 
non, including the participation leadership effect, the in-group bias effect, the 
cohesiveness performance effect, and the well-known groupthink effect (cf. 
Mullen, Anthony, Salas, & Driskell, 1994; Mullen, Brown, & Smith, 1992; 
Mullen & Copper, 1994; Mullen, Salas, & Driskell, 1989). 

From other research, we know that information pooling is critical to team- 
based decision making. That is, the size of a team may impact the effectiveness 
with which a team pools common and unique information. For example, Stasser, 
Vaughan, and Stewart (2000) discussed the tendency for group members to dis¬ 
cuss shared information rather than the unique information that is held by team 
members—an issue that becomes more problematic as teams increase in size. 
This lack of attention to unique information held by individual team members 
can lead to mal-informed decisions and occasionally even detrimental results. 

Salas and colleagues (1999) presented the only existing research integration 
that has assessed the relative efficacy of team building for teams of different 
sizes. They found that the effects of team building on performance decreased 
as a function of the size of the team, both in objective measures and in subjec¬ 
tive measures of performance. These authors concluded that any positive effect 
of team building is most likely to prevail only in small teams. Similarly, others 
have argued that as group size increases, members’ liking for the group (Indik, 
1965) and performance (Mullen, 1987) tend to decrease. Taking these findings 
into consideration, the current research examined team size as a moderator of 
the effectiveness of team-building efforts on team performance. In this study, 
we seek to replicate Salas and colleagues’ findings. However, rather than sim¬ 
ply correlating the database of effect sizes with their associated team sizes, the 
current research will examine the efficacy of team building for three distinct 
subgroup classifications of team size: small teams (i.e., less than 5 members); 
medium-sized teams (i.e., 5 to 10 members); and large teams (i.e., greater than 
10 members). In addition, although there is some reason to expect that larger 
teams have an increased cognitive capacity with which to perform tasks (e.g., 
Bantel & Jackson, 1989), the other negative issues often associated with 
increased group size are expected to be the overwhelming influences on the 
performance of teams. Therefore, it is reasonable to suggest that larger teams 


Downloaded from sgr.sagepub.com at UNIV OF NEW ORLEANS on June 17, 2014 


Klein et al. / Team Building 17 


are already performing at a lower level than medium or small teams, and would 
therefore exhibit enhanced benefits from team-building interventions. Stated 
differently, as teams increase in size, it is more likely they will show substan¬ 
tial benefits from team building. This expectation, although supported by 
theory, is in direct contradiction to the findings reported by Salas and col¬ 
leagues. The following hypothesis is proposed to investigate this assertion. 

Hypothesis 4: Large teams will show greater benefits from team building 
than small- or medium-sized teams. 


Method 


Literature Search 

A comprehensive literature review was conducted to identify published 
and unpublished studies relevant to the effects of team building on the four 
team outcomes. As a starting point for our literature review, we conducted 
an online search of the Defense Technical Information Center (DTIC), 
Google Scholar ©, and the most common academic search databases 
(e.g., PsycINFO). The particular keywords that were used included, but were 
not limited to, the following terms: team building, team development, team 
goal setting, interpersonal relations, problem solving, role clarification, group 
building, and group development. In addition to these search techniques, key 
articles relevant to team building were inspected by hand for additional, 
potentially useful primary studies (e.g., Buller, 1986; DeMeuse & Liebowitz, 
1981; Salas et al., 1999; Tannenbaum et al., 1992; Woodman & Sherwood, 
1980). This ancestry approach was later extended to each of the articles that 
were included in the database in an effort to ensure that no fugitive studies 
had been overlooked. In the end, the literature search process resulted in 103 
articles being identified for potential inclusion in the database. 

Criteria for inclusion. Specific limits were placed on the search to include 
articles published from 1950 to 2007. This date range was selected because 
previous reviews had not uncovered any published or unpublished team¬ 
building evaluations prior to 1950. Moreover, studies were included only if 
they involved adult, human, and nonclinical populations, and focused on 
teams and not groups. Finally, studies had to report data sufficient to calcu¬ 
late an effect size (r) assessing the relationship between team-building inter¬ 
ventions and outcomes. 
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Coding Procedure 

Following the literature review, three authors independently coded each of 
the relevant studies on 14 categories: (a) nature of the organization and par¬ 
ticipant sample; (b) team type; (c) number of teams; (d) average team size; 
(e) predictor reliability; (f) level of analysis of predictor; (g) criterion relia¬ 
bility; (h) level of analysis of the criterion; (i) criterion report type(s); (j) cri¬ 
terion description(s); (k) focus/component of team building (i.e., goal setting, 
interpersonal relations, problem solving, role clarification); (1) effect size(s); 
(m) study design type; and (n) recommendation for inclusion. Concerning the 
level of analysis coding, the initial database consisted of 69 effect sizes—60 
at the team level and 9 at the individual level. However, mixing levels of 
analysis is not recommended for research integrations for numerous reasons 
(e.g., Gully, Devine, & Whitney, 1995; Hunter & Schmidt, 1990). Therefore, 
only the team-level outcomes were analyzed in the current research, reducing 
our database to the 60 team-level correlations. 

Another point concerning the coding process that bears clarification is the 
assessment of the focus or component of team building. It was expected that 
many of the interventions being examined would consist of multiple compo¬ 
nents of team building. Thus, coders were instructed to allocate a total of 
100 percentage points to each of the four components of team building. For 
example, if a coder believed an intervention consisted of equal parts role 
clarification and interpersonal relations, each was coded with 50 percentage 
points, whereas problem solving and goal setting were given a 0 for this cod¬ 
ing category. For this process, the coders were instructed to closely examine 
the effort, intensity, and primary focus of the intervention. They were 
instructed further that they would need to justify the weights they assigned 
during a consensus discussion of the final articles included in the database. In 
this discussion, the three coders came to a consensus concerning the alloca¬ 
tion of percentage points to each team-building intervention. 

Rater reliability. Articles were coded by one of three of the authors. To 
aid in the consistency of coding, the three authors met early in the process 
to discuss the 14 pieces of information that were to be extracted from each of 
the 103 articles. Moreover, the coders had previously pilot tested the coding 
scheme, and possessed satisfactory level of expertise in the substantive 
areas being coded. To estimate the reliability with which the coders were 
evaluating the primary studies, three authors independently coded the same 
set of 20 articles (the reliability sample, representing approximately 19% of 
the total 103 articles selected for coding) to determine interrater reliability 
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by calculating intraclass correlation coefficients (ICCs; Nunnally, 1978; 
Shrout & Fleiss, 1979). The coders then each independently coded approxi¬ 
mately one third of the remaining articles. 

During the coding process, the decision to include or exclude the article for 
the final analyses was of primary concern. Upon inspection of the 20 studies 
coded by each of the three coders, it was revealed that all three coders were in 
agreement 85% of the time with their global evaluation to include (or not to 
include) particular studies. The ICC for this assessment was satisfactory 
(3, k) - .85. Importantly, for the three studies for which there was not complete 
initial agreement the authors met to discuss the discrepancies and a consensus 
was reached. The ICCs calculated for seven of the categories ultimately used 
in either main or exploratory analyses ranged from ICC (3, k) = .85 to ICC (3, 
k) = 1.0. In addition, the agreement among the three coders across all seven 
categories resulted in an ICC (3, k) = .96. The remaining six categories that 
were utilized for the ICC calculations included: team type, number of teams 
in the primary studies under investigation, average team size, focus/compo¬ 
nent of the team-building intervention, effect size estimate, and study design. 
The ICCs for these categories were .94, .94, .97, .94, .91, and 1.0, respectively. 
In conclusion, the consistency and agreement among the raters was very good. 

Meta-Analysis Procedure 

The software for the Hunter-Schmidt meta-analysis methods was used to 
analyze the data (Schmidt & Le, 2005). This program provides output that 
includes, but is not limited to: (a) the mean true score correlation, (b) the vari¬ 
ance and standard deviation of true score correlations, (c) credibility intervals, 
(d) estimates of the variance and standard deviation in observed correlations 
due to artifacts, and (e) the percentage of variance attributable to observed cor¬ 
relations after the removal of artifacts. In addition, the software employs a ran¬ 
dom effects model to combine effect sizes from primary studies. Random 
effects meta-analysis models allow the true effect sizes to vary, in contrast with 
fixed-effects models that assume the true effect sizes have fixed values. 
Moreover, the random effects model is considered to be “more realistic than 
the fixed-effect model on the majority of occasions” (Field, 2001, p. 162). 

Effect-Size Calculations 

Before calculating a meta-analytic estimate of the relationship between 
variables, effect sizes culled from primary studies must be prepared for 
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entry into the database. During this process, effect sizes from primary stud¬ 
ies were converted to the common metric r (correlation). Thus, when nec¬ 
essary, primary study effect sizes reported as other statistics (e.g., t , F, d , % 2 , 
or Z) were transformed using the formulas found in Hunter and Schmidt 
(2004). Once placed on this common metric of effect size, the results of 
independent tests found in primary studies can be combined and assessed 
for fit with predicted hypotheses in the meta-analysis. However, it is 
common knowledge that when a study contains multiple effect sizes, they 
are stochastically dependent (Shadish, Cook, & Campbell, 2002). These 
dependencies violate the statistical assumption of independent effect 
sizes. The recommended solution is to average or combine effect sizes 
from different measures of the same sample within a single study prior to 
combining results from multiple studies. 

Corrections for Unreliability and Range Restriction 

It is common in meta-analyses to make attempts to correct obtained 
reliability coefficients for measures of the predictor, criterion, or both (e.g., 
Hunter & Schmidt, 1990; Johnson, Mullen, & Salas, 1995). Unfortunately, 
original studies often fail to report all of the auxiliary information necessary 
to perform corrections for study artifacts. Such was the case with the cur¬ 
rent database. As a result, artifact distribution meta-analysis was employed, 
rather than correcting each effect size individually for unreliability. 

Authors of meta-analytic integrations also make attempts to correct for 
the effects of direct or indirect range restriction. Performing corrections for 
range restriction will generally result in a combined estimate that is more 
accurate than had no corrections been performed at all (e.g., Hunter, 
Schmidt, & Le, 2006). However, in the current investigation, primary 
studies did not report information on restricted and unrestricted samples 
that would be necessary to perform corrections for range restriction. We 
therefore cannot correct for range restriction in the current meta-analysis. 
Consequently, our effect-size estimates may be conservative, in the sense 
that they may underestimate the true effect of team building if the biasing 
effects of range restrictions could be corrected. 

Weighting 

Primary study effect sizes included in the current series of meta-analytic 
integrations were weighted by their sample sizes. It is presumed that effect 
sizes obtained from studies with larger sample sizes are more stable (i.e., 
accurate) than those culled from studies with small sample sizes. Although 
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it is important to remember that large sample studies are not necessarily 
more valid, recent simulation studies have determined that a very accurate 
combined effect size can be obtained through procedures used to weight 
primary studies by sample size (e.g., Field, 2005). 

Results 


Description of the Database 

A total of 60 correlations were obtained from 20 studies. These 60 effect 
sizes represented 1,562 teams, with a median team size of approximately 9 
members. Although these 60 effect sizes were not all from independent 
samples, every subgroup analysis that was performed included only inde¬ 
pendent samples. Otherwise, for the overall assessment of the influence of 
team building on the combined set of team outcomes, there were 26 inde¬ 
pendent samples (see Table 3). Of the studies included in the meta-analytic 
database, 14 were published and 6 were unpublished. Table 3 provides a 
description of the key information derived from each primary study. 

Publication/Availability Bias Detection 

When performing any meta-analysis it is difficult to determine whether 
all relevant studies have been located. Existing empirical evaluations that 
are elusive to locate and retrieve are commonly referred to as fugitive 
literature (Rosenthal, 1994). To the extent that this fugitive literature repre¬ 
sents a substantial proportion of the conducted evaluations in any one area, 
there is a possibility of publication or availability bias. The file drawer 
problem in meta-analyses presents itself when there is a concern that the 
studies that find their way into publication are simply the ones that show 
significant results, whereas evaluations with nonsignificant findings are rel¬ 
egated to file drawers (Rosenthal, 1979). To alleviate this concern, 
Rosenthal’s (1979) file drawer analysis can be used to provide an estimate 
of the number of unpublished studies relegated to file drawers. This analy¬ 
sis designates the number of these articles, which average null results, that 
would be required to bring the significance level for a set of studies down 
to the just-significant level (Hunter & Schmidt, 2004). However, it has been 
demonstrated that the widely used fail-safe file drawer analysis is essen¬ 
tially irrelevant and can result in incorrect estimates of the size of the file 
drawer (Scargle, 2000). Instead, funnel plots can serve as an alternative to 
file drawer analysis for the purpose of detecting the possibility of bias. 
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Funnel plot. The funnel plot (Light & Pillemer, 1984) is a simple visual 
tool (i.e., scatterplot) for detecting the presence of publication or other 
availability bias in meta-analysis. To apply this technique, effect sizes are 
graphed on the horizontal axis, whereas study sample sizes (N) are graphed 
on the vertical axis. The idea is that, in the absence of bias, the results of 
small sample studies will scatter widely at the bottom of the graph, with the 
spread narrowing for larger N studies; thus taking the form of an inverted 
funnel (Hunter & Schmidt, 2004). If publication bias is present in the data, 
the figure will be asymmetrical and often truncated in the lower left-hand 
portion of the scatterplot. As seen in Figure 2, the data from primary stud¬ 
ies included in the current meta-analysis form a somewhat regular funnel. 
Thus, it appears that the small sample studies are spread rather evenly 
across the effect-size continuum, making the possibility of publication or 
availability bias less of a concern in this research. 

Meta-Analytic Results 

The meta-analytic results for the hypotheses investigated in this research 
are presented in Tables 4 through 6. These tables show a number of pieces 
of information, including: the number of teams in each analysis (N)\ the 
number of independent effect sizes (correlations) in each analysis (k); the 
mean weighted observed correlation (r); the 80% confidence interval for 
that correlation; the estimated true score correlation (p); the standard 
deviation of this true score correlation ( SD p ); the 80% credibility interval 
(10% CV and 90% CV); and the percentage of variance accounted for by 
statistical artifacts. Confidence and credibility intervals are useful as aids in 
providing the best estimate of the true nature of the relationships between 
two variables (Whitener, 1990). Confidence intervals are applied to observed 
scores, center on a single mean score, and reflect the effects of sampling 
error. Credibility intervals, on the other hand, are particularly meaningful 
because they take into account information about the distribution of effect 
sizes after other research artifacts have been taken out. Credibility intervals 
can also be useful for determining whether moderators are operating 
(Whitener, 1990). Similarly, the percentage of variance estimate provides 
information concerning the variance in observed correlations that is the 
result of statistical artifacts. In general, the higher the percentage, the more 
certain we can be that additional moderators are not operating. The following 
sections more closely examine the results for the individual hypotheses that 
were put forward in this research. 
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Figure 2 

Funnel Plot for Detecting the Possibility of 
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Team-building effectiveness. The omnibus test representing all team¬ 
building interventions and outcomes resulted in a significant tendency for 
these interventions to improve team outcomes. Specifically, the mean true 
score correlation of .31 represents a moderate effect (10% CV = .11; 90% 
CV = .52) and provided support for Hypothesis 1. This analysis included 26 
independent effect sizes and was based on a total sample size of 579 teams. 
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Table 4 

Analysis of the Effectiveness of Team Building 
Based Upon Outcome Type 


Outcome 

Type 

N 

k 

r 

Cl r 

10% 

Cl r 

90% 

P 

sd; 

10% 

CV 

90% 

CV 

% Var. 

Acct. c 

Cognitive 

71 

3 

.11 

.06 

.17 

.13 

.00 

.13 

.13 

1,394.24 

Affective 

482 

19 

.41 

.34 

.49 

.44 

.18 

.21 

.66 

55.42 

Process 

485 

20 

.39 

.31 

.46 

.44 

.20 

.18 

.69 

54.04 

Performance 

524 

18 

.25 

.16 

.33 

.26 

.23 

-.03 

.55 

45.11 

All 

outcomes 

579 

26 

.28 

.22 

.29 

.31 

.16 

.11 

.52 

69.35 

All 

outcomes 21 

1,562 

60 

.34 

.29 

.38 

.37 

.21 

.10 

.64 

50.64 


Note: k = number of correlations coefficients on which each distribution was based; r= mean 
observed correlation; Cl. 10% = lower bound of the confidence interval for observed r; CI r 
90% = upper bound of the confidence interval for observed r; p = estimated true correlation 
between the predictor construct and the relevant criterion (fully corrected for measurement 
error in both the predictor and the criterion); SD p = estimated standard deviation of the true 
correlation; 10%CV = lower bound of the credibility interval for each distribution; 90% CV = 
upper bound of the credibility interval for each distribution; % var. acct. = percentage of 
observed variance accounted for by statistical artifacts. 

a. This result represents the entire database and patently violates the assumption of indepen¬ 
dent effect sizes as many studies contributed more than one effect size. It is shown here for 
illustrative purposes only. 

b. The SD p being zero indicates that the real variance of the true correlation is zero—there is 
only one value of the true correlation underlying all the studies. It is a consequence of the 
percentage of variance accounted for estimate being greater than 100%. This result also indi¬ 
cates that there should be no additional moderators operating for this analysis. 

c. The percentage of variance explained estimate being greater than the theoretical maximum 
value of 100% indicates that sampling error and other study artifacts explain all of the 
observed variation in the effect sizes (correlations) across studies. The estimated value is 
greater than 100% because of second-order sampling error. 


Here, cognitive, affective, process, and performance outcomes were aver¬ 
aged for each primary study before figuring into the combined estimate. The 
only exception was for six of the primary studies that reported the results of 
team-building interventions for separate samples within the same study. For 
these instances only, there were multiple correlations from single studies 
that contributed to the combined estimate. However, care was taken to 
ensure the assumption of independent effect sizes was not violated. 

Investigation of separate outcomes. From the overall database of 60 effect 
sizes, this research also assessed the impact of team building on distinct 
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Table 5 


Analysis of the Effectiveness of Team Building Based Upon 
Team-Building Component 


Team-Building 

Component 

N 

k 

r 

Cl r 

10% 

CI r 

90% 

P 

sd; 

10% 

CV 

90% 

CV 

% Var. 
Acct. b 

Goal setting 

258 

10 

.34 

.21 

.47 

.37 

21 

.02 

.71 

37.65 

Interpersonal 

relations 

140 

13 

.23 

.15 

.31 

.26 

.00 

.26 

.26 

233.50 

Problem solving 

326 

11 

.23 

.16 

.29 

.24 

.00 

.24 

.24 

114.27 

Role clarification 

54 

5 

.32 

.22 

.42 

.35 

.00 

.35 

.35 

322.06 


Note, k = number of correlations coefficients on which each distribution was based; f- mean 
observed correlation; Cl. 10% = lower bound of the confidence interval for observed r; CI r 
90% = upper bound of the confidence interval for observed r; p = estimated true correlation 
between the predictor construct and the relevant criterion (fully corrected for measurement 
error in both the predictor and the criterion); SD p = estimated standard deviation of the true 
correlation; 10% CV = lower bound of the credibility interval for each distribution; 90% CV 
= upper bound of the credibility interval for each distribution; % var. acct. = percentage of 
observed variance accounted for by statistical artifacts. 

a. The SD p being zero indicates that the real variance of the true correlation is zero—there is 
only one value of the true correlation underlying all the studies. It is a consequence of the per¬ 
centage of variance accounted for estimate being greater than 100%. This result also indicates 
that there should be no additional moderators operating for this analysis. 

b. The percentage of variance explained estimate being greater than the theoretical maximum 
value of 100% indicates that sampling error and other study artifacts explain all of the 
observed variation in the effect sizes (correlations) across studies. The estimated value is 
greater than 100% because of second-order sampling error. 


outcomes. Specifically, we evaluated the impact team building had on cogni¬ 
tive, affective, process, and performance outcomes. The number of effect 
sizes for these subgroup analyses ranged from 3 to 20. Table 4 presents the 
results of these investigations. 

In evaluating the impact of team building on cognitive outcomes, three 
effect sizes were analyzed (N = 71 teams). The estimated true score corre¬ 
lation (p) for the relationship between team building and improvements in 
cognitive outcomes was .13. Although it appears that team building has a 
negligible impact on these outcomes, this result should be interpreted with 
extreme caution since there were only three effect sizes available to com¬ 
bine. For the analysis of the influence of team building on affective out¬ 
comes, the estimated true score correlation was .44 (10% CV = .21; 90% 
CV = .66). This result represented the accumulation of data from 19 effect 
sizes and a total of 482 teams. Concerning process outcomes, the results 
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Table 6 

Analysis of the Effectiveness of Team Building Based Upon Team Size 


Team 

Size 

N 

k 

r 

Cl 

10% 

Cl r 

90% 

P 

sd; 

10% 

CV 

90% 

CV 

% Var. 
Acct. b 

Small 

178 

7 

.26 

.15 

.37 

.28 

.12 

.12 

.44 

74.22 

Medium 

340 

10 

.25 

.14 

.36 

.27 

.21 

.00 

.54 

45.72 

Large 

61 

9 

.54 

.44 

.64 

.66 

.00 

.66 

.66 

197.54 


Note: k = number of correlations coefficients on which each distribution was based; r= mean 
observed correlation; CI r 10% = lower bound of the confidence interval for observed r; CI r 
90% = upper bound of the confidence interval for observed r; p = estimated true correlation 
between the predictor construct and the relevant criterion (fully corrected for measurement 
error in both the predictor and criterion); SD p = estimated standard deviation of the true cor¬ 
relation; 10% CV = lower bound of the credibility interval for each distribution; 90% CV = 
upper bound of the credibility interval for each distribution; % var. acct. = percentage of 
observed variance accounted for by statistical artifacts. 

a. The SD p being zero indicates that the real variance of the true correlation is zero—there is 
only one value of the true correlation underlying all the studies. It is a consequence of the per¬ 
centage of variance accounted for estimate being greater than 100%. This result also indicates 
that there should be no additional moderators operating for this analysis. 

b. The percentage of variance explained estimate being greater than the theoretical maximum 
value of 100% indicates that sampling error and other study artifacts explain all of the 
observed variation in the effect sizes (correlations) across studies. The estimated value is 
greater than 100% because of second-order sampling error. 


suggested an estimated true score correlation of .44 (k = 20 correlations; 
N = 485 teams). The 80% credibility interval for this finding ranged from 
.18 to .69. Finally, for the analysis on performance outcomes, the results 
indicated an estimated true score correlation of .26 (k = 18; N = 52). 

All told, the results of the analyses focused on the influence of team 
building on separate outcomes provided moderate support for Hypothesis 2a. 
Specifically, team building appeared to be effective for improving each of 
the four outcomes. However, it should be pointed out that the credibility 
interval for performance outcomes barely included 0 (10% CV = -.03; 90% 
CV = .55). At the same time, Hypothesis 2b received only partial support. 
Although it was posited that team building would be most effective for 
improving affective outcomes, the results from this study suggested that 
team building was slightly more effective for improving process outcomes 
(p = .439 vs. .437). However, given the considerable overlap in both confi¬ 
dence and credibility intervals for these analyses (see Table 4), it is difficult 
to determine with any certainty which outcome type is most greatly affected 
by team-building interventions. Moreover, team building did appear to be 
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more effective for improving affective outcomes than either cognitive or 
performance outcomes (ps = .13 and .26, respectively). 

Focus of team building. A total of 39 correlations were meta-analyzed to 
assess the differential impact of various team-building components. For this 
analysis, the interventions described in primary studies were closely scruti¬ 
nized to determine which component(s) of team building was the major 
focus. In some instances it was determined that the intervention had a 
multiple focus on two or more separate components. Therefore, in accu¬ 
mulating the study results, correlations from primary studies were some¬ 
times included in multiple categories for the purposes of analysis so that 
there was some degree of overlap regarding the analysis of team-building 
components. Table 5 presents the results of these analyses. 

Ten effect sizes, representing 258 teams, were meta-analyzed for the 
investigation into the efficacy of the goal-setting component of team building. 
The estimated true score correlation was .37, indicating a moderate effect on 
the combined set of team outcomes. The remaining team-building compo¬ 
nents, namely, interpersonal relations, problem solving, and role clarification, 
also resulted in moderate effect sizes. The estimated true mean score cor¬ 
relations were .26 (k =13 ,N = 140), .24 (k = 11, N = 326), and .35 (k = 5, 
N = 54), respectively. The credibility intervals for these analyses were also 
calculated (see Table 5). All calculated intervals of the true mean score cor¬ 
relation did not include 0 (the 10% CV > .00 for all cases), thus indicating 
that the true mean score correlations are positive in most of their populations. 
In other words, the effects of team building based on all the components 
examined generalize across most situations and settings. The results of this 
analysis confirmed Hypothesis 3a, as each of the team-building components 
resulted in improved team functioning. However, there was only partial sup¬ 
port for Hypothesis 3b. Specifically, though the role-clarification component 
of team building (p = .35) appeared to be superior to either the interpersonal 
relations (p = .26) or problem-solving (p = .24) components, the goal-setting 
component (p = .37) appeared to work best of all. 

Team size. Twenty-six effect sizes were used to assess the potential mod¬ 
erating influence of team size on the ability of team building to improve 
team functioning. The results for this analysis are presented in Table 6. 
Effect sizes were categorized into three groups based on the average team 
size. Effect sizes grouped into the small team category included those that 
averaged less than 5 team members; the medium size category included 
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those effect sizes based on an average team size of 5 to 10 members; and 
the large team category included those effect sizes based on an average 
team size of more than 10 members. 

For small teams, a meta-analysis of 7 effect sizes, representing 178 
teams, indicated a mean true score correlation of .28. For medium-sized 
teams of 5 to 10 members, an analysis of 10 effect sizes representing 340 
teams resulted in a mean true score correlation of .27. Thus, for both small- 
and medium-sized teams there was a moderate effect of team building on 
team functioning. However, the results of a meta-analysis of team building 
for large teams, which consisted of 9 effect sizes (N = 61), suggested that 
the greatest impact of team building was upon teams that are large in size 
(p = .66). This rather large effect is in direct support of Hypothesis 4, which 
posited that the influence of team building would be most profound in large 
teams, as compared to small- or medium-sized teams. 


Discussion 

The present study was conducted to answer the question of whether 
team building works. The results are encouraging—they are suggestive of 
the idea that team building does improve team outcomes. Specifically, 
process and affective outcomes were most improved by team-building 
interventions. Moreover, all the components (i.e., role clarification, goal 
setting, interpersonal relations, and problem solving) of team building had 
a moderate effect on outcomes but the goal-setting and role-clarification 
components had the largest effect. Although teams of all sizes benefited 
from team building, large teams appeared to benefit the most. 

Despite the use of sophisticated meta-analytic techniques that were 
employed to assist in the examination of the four hypotheses that served as 
the focus for this research, our ability to definitively address the framing 
questions put forth and test for the hypothesized effects was dependent on 
the research available to us. In some instances, small subgroup effect sizes 
precluded fine-tuned assessment and the derivation of definitive conclu¬ 
sions. However, the findings from the current research were still instructive 
in many ways. As noted, the results from this study suggest that we are 
beginning to find some positive empirical support for the effectiveness of 
these commonly applied team-development strategies. 

What else have we learned? In addition to finding support for the effec¬ 
tiveness of team building in general, it was deemed equally important to 
understand whether the effectiveness of team building was moderated by 
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the specific outcomes that are targeted. Our second hypothesis posited that 
team-building interventions would result in improved outcomes across each 
of the four outcome types. This hypothesis was supported, with team build¬ 
ing shown to be effective for each of the outcome types examined. Of the 
four outcomes of interest, the results for cognitive outcomes were found to 
be the least robust. However, given the exceptionally small number of cor¬ 
relation coefficients that contributed to this subgroup analysis (i.e., k = 3), 
this finding may be properly viewed with considerable skepticism. The sec¬ 
ond part of this hypothesis was based upon prior theory and research in this 
area, and posited that team building would be most effective for improving 
affective outcomes. The findings indicated that team building is indeed 
highly effective for improving team member affective outcomes, but may 
be just as useful for facilitating improvements in team processes. Thus, it 
remains an open question as to which outcome is most greatly affected by 
team building—team processes or team member affective outcomes. 

Our next research inquiry led us to investigate whether the focus of team 
building moderated its effectiveness. First, it was found that each of the 
team-building components were useful for enhancing team functioning, 
with the estimated true score correlations ranging from .24 to .37. Similar 
to the Salas and colleagues’ (1999) meta-analysis, our data suggested that 
the role clarification component had the most impact on team outcomes. 
However, in contrast to Salas and colleagues, it appeared that the goal¬ 
setting component of team building was equally and perhaps more effective 
for improving team functioning as the role-clarification component. The 
results from this research also suggested that the interpersonal relations and 
problem-solving components were less effective. 

This research also examined whether team building is more effective for 
larger teams than smaller teams. Specifically, it was hypothesized that the 
effect sizes for small- and medium-sized teams would be weaker than those 
observed from team-building interventions conducted with large teams. 
Our prediction was supported—there was a substantial estimated true score 
correlation found for large teams and more modest results with teams that 
were classified as small or medium. One possible explanation for this result 
is the likely pre-intervention state of teams of various sizes. Although larger 
teams generally have a greater reservoir of available cognitive resources 
(Bantel & Jackson, 1989), it is also suggested that they are plagued by 
problems such as groupthink, the participation leadership effect, and the 
cohesiveness performance effect (e.g., Mullen et al., 1989). Given all the 
potential problems that can accompany larger teams, it is feasible that 
larger teams begin in a more negative state than smaller teams. Thus, there 
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may be less room for improvements from team building to manifest them¬ 
selves with teams of smaller size, which are typically more cohesive and 
less likely to experience these problems. 

In conclusion, this research extends the team-development literature by pro¬ 
viding an empirical assessment of the relationship between team building inter¬ 
ventions and outcomes. In addition, a number of potential moderators were also 
investigated. By extending the meta-analysis by Salas and colleagues (1999), 
more direction can be given to researchers who seek to further investigate spe¬ 
cific questions regarding the effectiveness of team building. However, the fact 
that our findings were not that consistent with those provided by Salas and col¬ 
leagues deserves a brief explanation. Specifically, how should we interpret the 
present findings in light of the contrasting findings of Salas and colleagues? 

Importantly, our database included approximately twice as many articles 
(20 vs. 11) and nearly four times as many correlations (60 vs. 16) as the pre¬ 
vious effort. In addition, although their study was published in 1999, the last 
included article in their database was published in 1990. In contrast, our data¬ 
base included seven sources put forward since 1990. The increased number of 
effect sizes included in our analyses should have resulted in more stable esti¬ 
mates of the relationships under investigation. It is also important to point out 
that 4 of the 11 articles included in the research by Salas and colleagues were 
excluded from our analyses. In one instance, we chose to include a published 
version of a dissertation included in the previous integration. In two other 
cases, we determined there was no real team-level intervention or team-level 
outcomes under investigation. Finally, for a fourth article, only individual- 
level outcomes were reported, making that study unusable for our purposes. 
These differences in meta-analytic databases, combined with the reality of 
using slightly more conservative meta-analysis procedures, may have led to 
the divergent findings between the two studies. Specifically, the current 
research used meta-analysis methods (i.e., Hunter & Schmidt, 1990; Hunter, 
Schmidt, & Jackson, 1982) that can be considered more conservative (Johnson 
et al., 1995) than the methods employed by Salas and colleagues (1999), 
whose data analyses were based on the methods employed by Rosenthal and 
colleagues (i.e., Rosenthal, 1991; Rosenthal & Rubin, 1988). 


Implications for Research and Practice 

What should academicians and human resources practitioners do when 
investigating or recommending team-building interventions? The answers 
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seem to lie in the examination of why a team requires a team-building inter¬ 
vention and the characteristics of the team. Human resources practitioners 
could play a more proactive role in identifying teams that could benefit 
from team building. Specifically, the finding that the role-clarification 
and goal-setting components improved performance over the other team¬ 
building components could benefit human resources practitioners and orga¬ 
nizational managers by providing increased clarity into ways in which 
leaders may best direct their teams (i.e., being clear about subordinates’ 
roles and setting goals). 

The results of the current meta-analysis provide encouraging news to 
human resource practitioners as well as to the many users of team-building 
interventions. Generally speaking, the data have suggested that team build¬ 
ing has a greater impact on some outcomes over others, and some team 
sizes over others. In a case where a manager suspects that his or her team 
may benefit from team building it would serve the manager to evaluate and 
identify the team’s characteristics, as well as the specific problems encoun¬ 
tered prior to intervening with team building. In other words, by communi¬ 
cating to practitioners the varied results that team-building interventions 
have on different outcomes, and considering the potential moderating influ¬ 
ence of team size, practitioners can be better prepared when assessing what 
type of intervention or change effort is most appropriate for their team. The 
results from this study reinforce the view that not all teams will benefit 
from the same team-building intervention. 


Study Limitations 

Despite the many interesting findings from the current research integra¬ 
tion, there are a number of limitations inherent in this study. First, one con¬ 
ceptual limitation of this study can be summed up by the potentially 
interesting and important moderator variables that we did not examine. 
These include team type, training duration, criterion source of measure¬ 
ment, and perhaps also a more exacting assessment of the level of interde¬ 
pendence of the teams under investigation in this integration. A proper 
analysis of these additional factors may represent a substantial contribution 
to the current knowledge base concerning team-building interventions. 

There were also a number of methodological limitations inherent to this 
research. For one thing, only a fraction of the primary studies reported suf¬ 
ficient data to allow for corrections of the unreliability of criterion 
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measurement. Depending on the analysis in question, only 10%-30% of the 
effect sizes in the database had an internal consistency estimate of reliabil¬ 
ity associated with it. Related to this issue, we took the liberty of assuming 
that the predictors, which in this case were the team-building interventions 
themselves, were implemented with 100% consistency and accuracy. 
Unfortunately, we were not aware of a better estimate of the reliability with 
which these interventions are characteristically implemented. Thus, our 
assumption and use of perfect predictor reliability is likely an overestimate 
of the true reliability of these interventions. 

An additional methodological concern in this research concerned the 
issue of having a relatively small number of correlations available for many 
of the subgroup analyses. Unfortunately, the strict criteria set for inclusion 
of primary studies in the database, combined with the relative paucity of 
published research in this area, somewhat limited the amount of data 
available to us. However, this issue has been relatively common in empiri¬ 
cal reviews of team building (e.g., Salas et al., 1999), and the current inte¬ 
gration still represents the most exhaustive statement currently available on 
this topic. 


Directions for Future Research and 
What We Still Need to Know 

This meta-analysis provides many answers that expand the literature on 
team-building interventions. However, there are still some questions that 
remain unanswered. Below we present a few recommendations for future 
research. Research is needed that examines the efficacy of team building for 
various types of teams. Especially needed are published studies of team¬ 
building interventions for management, and action teams. For example, in 
the strategic management literature it has been noted that top management 
teams are necessary partners to help set strategic direction, redesign organi¬ 
zation architecture, and improve business processes (Allaire, 1998). It would 
be well served for future studies to observe the impact that these interven¬ 
tions may have on organizational financial outcomes when implemented in 
management teams. Furthermore, future research could also examine if the 
reason by which teams are formed has any impact on the efficacy of a team¬ 
building intervention. If a team is formed because of a directive does team 
building affect it differently than a team that was formed by choice? 

Additionally, there is a need to further investigate the issue of team size 
as it relates to the efficacy of team building. It is an organizational reality 
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that organizations require teams of all shapes and sizes. The findings from 
this research, where team building appeared to work better for larger teams, 
were in direct contrast to those reported by Salas and colleagues (1999). 
Concerning the current findings, if team building does affect larger teams 
to a greater extent, does it do so only temporarily? Is there a greater need 
for follow-up development activities with large teams than small teams? 
Could it be that there is an interaction between team size and team type, 
such that team size is positively correlated with improvements in team 
functioning only for certain team types (e.g., production teams)? For other 
teams (e.g., project teams) it may be the case that team building works 
better with entities that are smaller in size. This issue could prove to be a 
fruitful area of inquiry, but is currently understudied. 

Lastly, very few of the studies analyzed here investigated the effects of 
team building over time. Instead, most were limited to an isolated postin¬ 
tervention measure of performance. Thus, there is a need for researchers to 
investigate the results of team building over the span of the team’s life. It is 
in this area that we feel practitioners could make the most impact. It is crit¬ 
ical to understand if teams that participate in team building work better as 
their time together increases. For example, the longer teams work together 
the more likely it is that problems will arise. Future research could assess 
if the techniques that teams use to resolve issues are based on what they 
learned from their team-building intervention. Ideally, such research would 
take pre-intervention measures of performance to calculate a baseline that 
would then be followed by multiple measures of performance after the 
team-building intervention. 


Concluding Remarks 

Our findings are encouraging—team building improves team outcomes; 
that these team-development interventions are beneficial to team functioning 
is the good news. However, we still need to know more about team building. 
What are the mechanisms by which it works? What specific features are best, 
and how can these be best designed and implemented? In short, more in- 
depth evaluations are needed. If our understanding of the effectiveness and 
boundary conditions of team building is to be further enhanced by the type of 
evidence-based conclusions derived from meta-analyses—again, we need 
more data and more evaluations. This is our call to practitioners of team¬ 
building interventions to lead the charge in assessing the impact that team 
building has on teams in the field. In practice, team building comes in many 
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forms (i.e., games, adventures, or exercises), and is a widely used interven¬ 
tion; this study is an attempt at explaining if it works. 
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